Database : Issued_Patents_AA: * 

1 : / cgn2__6 /pt oda t a / 1 / iaa / 5A_C0MB . pep : * 

2 : /cgn2_6/ptodata/l/iaa/5B_COMB.pep: * 

3 : /cgn2_6/ptodata/l/iaa/6A_COMB.pep: * 

4 : /cgn2_6/ptodata/l/iaa/6B_COMB.pep: * 

5: /cgn2_6/ptodata/l/iaa/PCTUS_COMB.pep: * 

6 : /cgn2_6/ptodata/l/iaa/backf ilesl .pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



Result 
No. 


Score 


Query 

Match Length 


DB 


ID 








Description 


1 


311 


100 


0 


311 


4 


US- 


09 


-634 


-955B-5 


Sequence 


5, Appli 


2 


311 


100 


0 


311 


4 


US- 


09 


-816 


-760-5 


Sequence 


5, Appli 


3 


311 


100 


0 


311 


4 


US- 


09 


-838 


-561-5 


Sequence 


5 , Appl i 


4 


52 


16 


7 


311 


4 


US- 


09 


-838 


-561-15 


Sequence 


15, Appl 


5 


9 


2 


9 


682 


4 


US- 


09 


-107 


-532A-4499 


Sequence 


4499, Ap 


6 


9 


2 


9 


1170 


4 


US- 


09 


-462 


-136-6 


Sequence 


6, Appli 


7 


8 


2 


6 


113 


3 


US- 


08 


-905 


-223-439 


Sequence 


439, App 


8 


8 


2 


6 


261 


4 


us- 


09 


-328 


-352-7731 


Sequence 


7731, Ap 


9 


8 


2 


6 


263 


4 


us- 


09 


-270 


-767-46093 


Sequence 


46093, A 


10 


8 


2 


6 


299 


4 


us- 


09 


-902 


-540-14916 


Sequence 


14916, A 


11 


8 


2 


6 


310 


4 


us- 


09 


-907 


-794A-153 


Sequence 


153, App 


12 


8 


2 


6 


310 


4 


us- 


09 


-905 


-125A-153 


Sequence 


153, App 


13 


8 


2 


6 


310 


4 


us- 


09 


-902 


-775A-153 


Sequence 


153, App 


14 


8 


2 


6 


310 


4 


us- 


09 


-906 


-700-153 


Sequence 


153, App 


15 


8 


2 


6 


310 


4 


us- 


09 


-903 


-603A-153 


Sequence 


153, App 


16 


8 


2 


6 


310 


4 


us- 


09 


-904 


-920A-153 


Sequence 


153, App 


17 


8 


2 


6 


310 


4 


us- 


09 


-909 


-064-153 


Sequence 


153, App 



Database 



Issued_Patents_AA: * 

1 : /cgn2_6/ptodata/l/iaa/5A_COMB.pep:* 

2 : /cgn2_6/ptodata/l/iaa/5B_COMB.pep: * 

3 : /cgn2_6/ptodata/l/iaa/6A_COMB.pep: * 

4 : /cgn2_6/ptodata/l/iaa/6B_COMB.pep: * 

5 : /cgn2_6/ptodata/l/iaa/PCTUS_COMB.pep: * 

6 : /cgn2_6/ptodata/l/iaa/backf ilesl .pep : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



Score 



% 

Query 

Match Length DB 



ID 



Description 



1 


1607 


100 


0 


311 


4 


US- 


09 


-634 


-955B-5 


Sequence 


5, Appli 


2 


1607 


100 


0 


311 


4 


US- 


09 


-816 


-760-5 


Sequence 


5, Appli 


3 


1607 


100 


0 


311 


4 


US- 


09 


-838 


-561-5 


Sequence 


5, Appli 


4 


1467 


91 


3 


311 


4 


US- 


09 


-838 


-561-15 


Sequence 


15, Appl 


5 


651.5 


40 


5 


310 


4 


US- 


09 


-907 


-794A-153 


Sequence 


153, App 


6 


651.5 


40 


5 


310 


4 


US- 


09 


-905 


-125A-153 


Sequence 


153, App 


7 


651.5 


40 


5 


310 


4 


us- 


09 


-902 


-775A-153 


Sequence 


153, App 


8 


651.5 


40 


5 


310 


4 


us- 


09 


-906 


-700-153 


Sequence 


153, App 


9 


651.5 


40 


5 


310 


4 


us- 


09 


-903 


-603A-153 


Sequence 


153, App 


10 • 


651.5 


40 


5 


310 


4 


us- 


09 


-904 


-920A-153 


Sequence 


153, App 


11 


651.5 


40 


5 


310 


4 


us- 


09 


-909 


-064-153 


Sequence 


153, App 


12 


651.5 


40 


5 


310 


4 


us- 


09 


-905 


-381A-153 


Sequence 


153, App 


13 


651.5 


40 


5 


310 


4 


us- 


09 


-906 


-618-153 


Sequence 


153, App 



Database : UniProt_03 : * 

1 : uniprot_sprot : * 
2 : uniprot_trembl : * 

. Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


52 


16. 


.7 


223 


2 


Q8CHS7 


Q8chs7 


mus musculu 


2 


9 


2 


. 9 


281 


1 


CG11 RAT 


P97586 


rattus norv 


3 


9 


2 


. 9 


1170 


2 


Q12200 


Q12200 


saccharomyc 


4 


8 


2 


.6 


83 


1 


YORC TTV1 


P19287 


thermoprote 


5 


8 


2 


.6 


134 


2 


Q89T92 


Q89t92 


bradyrhizob 


6 


8 


2 


.6 


137 


2 


Q6ZTJ1 


Q6ztjl 


homo sapien 


7 


8 


2 


.6 


150 


2 


Q747K4 


Q747k4 


geobacter s 


8 


8 


2 


.6 


216 


2 


Q72RN4 


Q72rn4 


leptospira 


9 


8 


2 


.6 


216 


2 


Q8F428 


Q8f428 


leptospira 


10 


8 


2 


.6 


226 


2 


Q8MT30 


Q8mt30 


drosophila 


11 


8 


2 


.6 


233 


2 


Q9CHT7 


Q9cht7 


lactococcus 


12 


8 


2 , 


.6 


235 


2 


Q6XHS1 


Q6xhsl 


drosophila 


13 


8 


2 


.6 


237 


2 


Q6IDG8 


Q6 idg8 


drosophila 


14 


8 


2. 


.6 


244 


2 


Q8D3B0 


Q8d3b0 


wiggleswort 



Database : PIR 79:* 



1: 


pirl 


* 


2 : 


pir2 


★ 


3: 


pir3 


★ 


4 : 


pir4 


★ 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


9 


2 


9 


1170 


2 


S52525 


probable membrane 


2 


8 


2 


6 


233 


2 


G86703 


oxidoreductase ygc 


3 


8 


2 


6 


260 


2 


E70881 


probable fabG5 pro 


4 


8 


2 


6 


261 


2 


G82082 


conserved hypothet 


5 


8 


2 


6 


262 


2 


T34378 


hypothetical prote 


6 


8 


2 


6 


293 


2 


T19954 


hypothetical prote 


7 


8 


2 


6 


325 


2 


T17307 


hypothetical prote 


8 


8 


2 


6 


341 


2 


D70761 


probabl e f a 1 1 y- acy 


9 


8 


2 


6 


371 


2 


A44122 


alpha-helical coil 


10 


8 


2 


6 


938 


2 


T34105 


hypothetical prote 


11 


8 


2 


6 


964 


2 


AG3433 


diguanylate cyclas 


12 


7 


2 


3 


135 


2 


G89751 


protein C33E10.10 


13 


7 


2 


3 


141 


2 


C97338 


hydroxymyristoyl- ( 


14 


7 


2 


3 


146 


2 


G97030 


probable membrane 


15 


7 


2 


3 


148 


2 


F86899 


hypothetical prote 


16 


7 


2 


3 


151 


2 


D75347 


hypothetical prote 


17 


7 


2 


3 


157 


2 


C71477 


probable ribityllu 


18 


7 


2 


3 


167 


2 


D82561 


probable signal pe 


19 


7 


2 


3 


184 


2 


B72601 


hypothetical prote 


20 


7 


2 


3 


198 


2 


B82080 


flavodoxin [simila 


21 


7 


2 


3 


200 


2 


T42066 


glutamate-ammonia 



Database : PIR 79:* 



1: 


pirl 


★ 


2 : 


pir2 


★ 


3: 


pir3 


* 


4 : 


pir4 


* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


640.5 


39 


9 


325 


2 


T17307 


hypothetical prote 


2 


311.5 


19 


4 


537 


2 


. T34380 


hypothetical prote 


3 


307 


19 


1 


262 


2 


T34378 


hypothetical prote 


4 


299.5 


18 


6 


278 


2 


E83152 


probable short -cha 


5 


274 


17 


1 


277 


2 


AG1983 


hypothetical prote 


6 


265.5 


16 


5 


. 263 


2 


AI1692 


oxidoreductase horn 


7 


265.5 


16 


5 


266 


2 


H97309 


short -cha in alcoho 


8 


255.5 


15 


9 


263 


2 


AH1321 


oxidoreductase horn 


9 


253 


15 


7 


259 


2 


A69965 


ketoacyl reductase 


10 


249 


15 


5 


259 


2 


B83838 


oxidoreductase BH1 


11 


245 


15 


2 


293 


2 


AD2176 


oxidoreductase all 


12 


242 


15 


1 


270 


2 


E86788 


oxidoreductase yne 


13 


241.5 


15 


0 


328 


. 2 


F75374 


probable ketoacyl 


14 


241 


15 


0 


247 


2 


F90254 


hypothetical prote 


15 


241 


15 


0 


253 


2 


A95223 


hypothetical prote 


16 


238.5 


14 


8 


287 


2 


C55210 


hetN protein - Ana 


17 


235.5 


14 


7 


287 


2 


AF2475 


ketoacyl reductase 


18 


234 .5 


14 


6 


307 


2 


E70082 


glucose 1-dehydrog 


19 


234 


14 


6 


261 


1 


A28788 


actinorhodin polyk 


20 


233 


14 


5 


332 


2 


S37652 


follicular lymphom 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: March 12, 2005, 13:06:08 ; Search time 174 Seconds 

(without alignments) 
915.268 Million cell updates/sec 

Title: US-10-664 -506-5 

Perfect score: 1607 

Sequence: 1 MGVMAMLMLPLLLLGI SGLL FFFAWACGVKEKLNVPEEG 311 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



1612378 seqs, 512079187 residues 



Total number of hits satisfying chosen parameters: 1612378 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : UniProt_03:* 

1 : uniprot_sprot : * 
2 : uniprot_trembl : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



j. 


I.OJ. 1 

M V J- / 


63 


3 


223 


_ 2 


*»••—- ' 


Q8chs7 


.mus musculu 


2 


651.5 


40. 


.5 


310 


2 


Q6UX59 


Q6ux59 


homo sapien 


3 


651.5 


40. 


.5 


325 


2 


Q9BTF9 


Q9btf9 


homo sapien 


4 


650.5 


40. 


. 5 


325 


2 


Q6IAN0 


Q6ian0 


homo sapien 


5 


640.5 


39. 


. 9 


325 


2 


Q9UFM6 


Q9ufm6 


homo sapien 


6 


625.5 


38. 


. 9 


323 


2 


Q99J47 


Q99j47 


m similar t 


7 


557 


34. 


. 7 


291 


2 


Q9Y3A1 


Q9y3al 


homo sapien 


8 


466 


29. 


. 0 


326 


2 


Q9Y140 


Q9yl40 


drosophila 


9 


458 


28 . 


.5 


317 


2 


Q7Q732 


Q7q732 


anopheles g 


10 


331 


20. 


.6 


311 


2 


Q65ZI3 


Q65zi3 


caenorhabdi 


11 


322 


20. 


.0 


264 


2 


Q81M93 


Q81m93 


bacillus an 


12 


321 


20. 


.0 


264 


2 


Q635E8 


Q635e8 


bacillus ce 


13 


320 


19. 


. 9 


264 


2 


Q731G1 


Q731gl 


bacillus ce 


14 


311.5 


19. 


.4 


537 


2 


Q22787 


Q22787 


caenorhabdi 


15 


309 


19. 


.2 


264 


2 


Q6HE26 


Q6he26 


bacillus th 



1 c 
lb 


7 HO c; 
3UZ . j 


1 Q 

io 


Q 

. O 


ZOO 


9 
Z 


06T7ft P9 
yor o r z 


06f ft f 9 

yO X O X Z 


XI J. C L.OXJCI v. L» 


x / 


TOO C 

z y y . 3 


1 Q 
lo 


. 0 


z / o 


9 
Z 


y yrLA3 -? 


y y 1 1 a. 3 y 


Ly d c UUUI 1 IvJI la. & 


ft 
x o 


9Q^ s 


1 ft 
X O 


A 
. *± 


7 7 Q 

3 J -7 


1 

X 


nwc;7 HT IMAM 






j. ^ 


99^ R 

Z J7 J . J 


1 ft 
X O 


A 


77R 

O / 3 


9 
z 


06R9Eft 




"homo c^a-ni on 


90 


990 R 

Z J7 \J . J 


1 ft 


1 

. X 


9 67 

ZD/ 


9 
z 


V^o IjU ru 


Oft pi 

\£0 CJ IIU 


SllCWallCl X a. 


91 
£i X 


9ft ft R 

Z O O . J 


1 ft 

x O . 




794 

3 Z *± 


9 
z 


X / Ix X 


D6 i 7r1 

S^D X / X X 


l a L L Ud ii^ix v 


99 
z z 


9ft £ R 
ZOO ■ J 


1 7 

J. / . 


Q 

. O 


9 6 n 

Z D \J 


9 
z 


HI R744 
VJX 3 / *± *± 


HI R744 


H "i /"* i~ cj t~ o 1 i 
UlLUyUoLCl x 


97 


9ft 9 

Z O 


1 7 


c: 

. 3 


7 7ft 

J JO 


X 


DH^7 MOTIVE 


J? LAI X 


IIILIO IIIUDUU1 u 


94 
z *± 


9 77 
z / / 


1 7 

X / , 


. z 


9 7 7 

Z / 3 


9 
z 


D6W1 PO 
yOni r v 






OC 
Z J 


97 £ 

Z / O 


1 7 

J. / , 


. Z 


7 7 £ 
jjO 


9 
Z 


n6npnft 




Atrliupilb Xclc 


96 

Z O 


974 
Z / *± 


1 7 


. 1 


9 77 
Z / / 


9 
Z 


Oft VYD 1 


yoyxu x 


dlldUcl cllci bp 


97 
z / 


971 
Z / X 


1 6 
IO 


Q 

. y 


ZOO 


9 
Z 


n6r*MT tr 




j\.x uyvtix (jiiiyL. 


9ft 


96^ R 


1 6 


. 3 


967 

Z O 3 


9 
z 


HQ9 A7Q 


f)Q9a 7Q 


1 t ohpyi a i n 
xxoL-Ci-Xd xn 


9Q 
z y 


96c; R 


1 6 
ID 


. 3 


966 
ZOO 


9 
z 


yy /ui 3 


yy /ay3 


cioscriai um 


70 


967 

Z O «J 


1 6 
X o 


A 

. rr 


9 Qft 

Z -7 O 


9 
z 


Oft 7Y\/ft 


Oft 7vrv/ , R 
yo /Avo 


riGonHnmnna g 

poC LlLlvJIIlVJllCl o 


7 1 

3 X 


969 R 
Z D z . o 


1 6 

XO 


. 3 


7 6 n 
z 0 V 


9 
Z 


y03nlr3 


yo3np3 


DaClllUS 11 


79 
o z 


960 r 

Z D V . 3 


1 6 

X D . 


9 
. z 


9 6 7 
Z O 3 


9 
Z 


r»7 1 V4 

y / X I fx 3 


y / xy*± 3 


llbCclla lUU 


7 7 


Z 3 O 


1 6 
IO . 


. i 


717 
3 XZ 


9 

z 


HQM1 3 6 

y yi\i xz o 


AQn 1 

yyiix z o 


UOb LaUIUb 


3 *± 


9 c; ft 

Z 3 O 


1 £ 
ID 


. X 


716 
3X0 


9 
Z 


HQTI Q7 

1 1 y / 


AO i- 1 Q9 

yo c i y / 


Q1CU y O blcll 


J 3 


Z3 0 


13 , 


Q 

. y 


3 Q n 
Z O U 


Z 


yb3r Do 


r\a cr f c q 
y03 x 3o 


bacillus li 


76 


9RR S 

Z »J J . J 


X Zj . 


Q 
. -7 


34 ft 
z *± o 


9 
z 


noDuo 9 

^7i\nz z 


\£ y x nz z 


z y Liikjuivjiict o lit 


3 / 


Z 3 3 . 3 


X 3 . 


Q 

. y 


9 67 
Z O 3 


9 
Z 


\£o 1 3oy 


yoy3 s y 


iibieria mo 


7ft 
J o 


9 R4 

Z 3 r± 


13 . 


Q 

. o 


7 n 7 


9 
Z 


yor /do 


Pl6 -FTViQ 

yo x /Do 


dCineLODdCL 


7 Q 


9 R7 
Z 3 J 


1 3 . 


. / 


z 3 y 


X 




rr 3^t 3 3fi 


DdClllUS SU 


40 


253 


15. 


. 7 


295 


2 


Q8RR58 


Q8rr58 


acinetobact 


41 


252 


15. 


.7 


295 


2 


P94129 


P94129 


acinetobact 


42 


250.5 


15. 


.6 


271 


2 


Q7UPD8 


Q7upd8 


rhodopirell 


43 


249.5 


15. 


.5 


311 


2 


Q9NYR8 


Q9nyr8 


homo sapien 


44 


249 


15. 


.5 


259 


2 


Q9KCR3 


Q9kcr3 


bacillus ha 


45 


247 


15. 


.4 


276 


2 


Q9ALU7 


Q9alu7 


enterobacte 



ALIGNMENTS 



RESULT 1 
Q8CHS7 

ID Q8CHS7 PRELIMINARY; PRT; 223 AA. 

AC Q8CHS7; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2004 (TrEMBLrel. 26, Last annotation update) 



OXLLtXXClX \.Oi -oO" px uVC'ifi . 



OS Mus musculus (Mouse) . 

0C Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

0C Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI___TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC STRAIN=C57BL/6J; TISSUE=Mammary gland; 

RX MEDLINE=22388257; PubMed=l 24 77932 ; D0I =10 . 1073/pnas . 242603899 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G. , 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM. , Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H. , Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T., Max S.I., Wang J. , Hsieh F . , 

RA Diatchenko L. , Marusina K. , Farmer A. A. , Rubin G.M., Hong L. , 

RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L. , Scheetz T.E., 



RA Brownstein M.J., Usdin T.B. , Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C. , Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J. , Lu X., Gibbs R.A. , 

RA Fahey J., Helton E. , Ketteman M. # Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., 

RA Krzywinski M.I., Skalska U. , Smailus D.E., Schnerch A., Schein J.E., 

RA Jones S.J., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Mammary gland; 

RA Strausberg R.; 

RL Submitted (NOV-2002) to the EMBL/GenBank/DDBJ databases. 

CC -!- SIMILARITY: Belongs to the short -chain dehydrogenases /reductases 

CC (SDR) family. 

DR EMBL; BC039565; AAH39565.1; -. 

DR GO; GO:0016491; F : oxidoreductase activity; IEA. 

DR GO; GO: 0008152; P : metabol ism; IEA. 

DR InterPro; IPR002198; ADH_short . 

DR InterPro; IPR002347; Adh__short_C2 . 

DR Pfam; PF00106; adh_short; 1. 

DR PRINTS; PR00081; GDHRDH . 

DR PRINTS; PR00080; SDRFAMILY. 

DR PROSITE; PS00061; ADH_SHORT; UNKNOWN^l . 

KW Oxidoreductase. 

SQ SEQUENCE 223 AA; 25088 MW; 36838 1EADF4 15B90 CRC64 ; 

Query Match 63.3%; Score 1017; DB 2; Length 223; 

Best Local Similarity 90.7%; Pred. No. 4.2e-76; 

Matches 194; Conservative 7; Mismatches 13; Indels 0; Gaps 0; 
Qy 98 LDLS D I S CVPDVAKE VLDC YGCVD I L I NNAS VKVKG PAHKI SLELDKKI MDAN YFG P I TL 157 

HIMIMI : .1.1 1 1 i i : , ; : i i 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 ] 1 1 1 1 1 1 1 

Db 10 VDLSD I S CVQDVAKE VLDCYGCVD I L I NNAS VKVKG PAHKI SLELDKKI MDAN YFG P I TL 69 

Qy 158 TKALLPNMISRRTGQIVLVNNIQGKFGIPFRTTYAASKHAALGFFDCLRAEVEEYDWIS 217 

11 I Mil! I ill ! ! I !!!!!!! ill!!!!! !!!! !!! -! ! !!!!!!!•!!! ! ! U4 

Db 70 TKVLLPNMISRRTGQIVliWNIQAKPGIPFRTAYAASKHAVMGFFDCLRAEVEEYDVVVS 129 

Qy 218 TVS PTFI RS YHVYPEQGNWEAS I WKFFFRKLTYG VTiPVEVAEEVTVIRTWRKXQE 277 

llllhllll III Ml II III III U I II II h II 

Db 130 WSPTYIRSYRASPEQRNWETSICKFFCRKLAYGVHPVTIVAEEVMRTWRKKQ 189 

Qy 278 I PKAAVYVRTFFPEFFFAWACGVKEKLNVPEEG 311 

: I I II h ^ I II I I I I I I I I I II I II II I II I I I I 
Db 190 VPKAAVFIRTFFPEFFFAWACGVKEKLNVPEEG 223 

RESULT 2 
Q6UX59 

ID Q6UX59 PRELIMINARY; PRT; 310 AA. 



AC Q6UX59; 

DT 05-JUL-2004 (TrEMBLrel . 27, Created) 

DT 05-JUL-2004 (TrEMBLrel. 27, Last sequence update) 

DT 05-JUL-2004 (TrEMBLrel. 27, Last annotation update) 

DE DFIT212. 

GN ORFNames =UNQ2 1 2 ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI JTaxID=9606 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE-22887296; PubMed=12975309 ; DOI=10 . 1101/gr. 1293003 ; 

RA Clark H.F., Gurney A.L., Abaya E., Baker K. , Baldwin D., Brush J., 

RA Chen J. , Chow B., Chui C. , Crowley C. , Currell B., Deuel B., Dowd P., 

RA Eaton D. , Foster J., Grimaldi C. , Gu Q. , Hass P.E., Heldens S., 

RA Huang A., Kim H.S., Klimowski L. , Jin Y. , Johnson S., Lee J., 

RA Lewis L., Liao D. , Mark M., Robbie E . , Sanchez C. # Schoenfeld J., 

RA Seshagiri S., Simmons L. , Singh J., Smith V., Stinson J., Vagts A., 

RA Vandlen R. , Watanabe O, Wieand D. , Woods K. , Xie M.H. , Yansura D. , 

RA Yi S., Yu G., Yuan J., Zhang M . , Zhang Z., Goddard A., Wood W.I., 

RA Godowski P. ; 

RT "The secreted protein discovery initiative (SPDI), a large-scale 

RT effort to identify novel human secreted and transmembrane proteins: a 

RT bioinformatics assessment."; 

RL Genome Res. 13:2265-2270(2003). 

CC -!- SIMILARITY: Belongs to the short-chain dehydrogenases/reductases 

CC (SDR) family. 

DR EMBL; AY358498; AAQ88862.1; 

DR GO; GO: 0016491; F : oxidoreductase activity; IEA. 

DR GO; GO: 0008152; P : metabol ism; IEA. 

DR ' InterPro; I PRO 02 198; ADH_short . 

DR InterPro; IPR002347; Adh_short_C2 . 

DR Pfam; PF00106; 'adh_short; 1. 

DR PRINTS; PR00081; GDHRDH . 

DR PRINTS; PR00080; SDRFAMILY. 

DR PROSITE; PS00061; ADH_SHORT; UNKNOWN_l . 

KW Oxidoreductase. 

SQ SEQUENCE 310 AA; 33524 MW; 5BE703478EE20BD7 CRC64; 



Query Match 40.5%; Score 651.5; DB 2; Length 310; 

Best Local Similarity 43.5%; Pred. No. 1.2e-45; 

Matches 1-3-87 Conservative- 59; Mismatches 89; Indels 31; .Gaps 6; 

Qy 3 VMAMLMLPLLL - - LG I SGLLF I YQEVSRLWS KSAVQNKVWI TDA I SGLGKECARVFHTG 60 

• Hill Ih II : I I h ::| Mill I I III III h I h 

Db 4 I TSTAI LPLLFGCLGVFGLFRLLQWVR GKAYLRNAWVI TGATSGLGKECAKVFYAA 60 

Qy 61 GARLVLCGKNWERLENLYDAL- 1 SVADPSKTFTPKLVLLDLSDI SCVPDVAKEVLDCYGC 119 

Ihllllhl III III : I III Ihl I hi hi 

Db 61 GAKLVLCGRNGGAJjEELI RELTASHATKVQTHKP YLVTFDLTDSGAI VAAAAE I LQCFGY 120 

Qy 120 VDI LI NNAS VKVKGPAHKI SLELDKKI MDANYFGP I TLTKALLPNM I SRRTGQI VLVNN I 179 

liri=lll : =1 ::::||::|: llllh |||||||:|| || | || :::| 

Db 121 VD I LVNNAG I S YRGT I MDTTVDVDKR VM ETN YFG P VALTKALLPSM I KRRQGH I VA I S S I 180 

Qy 180 QGKFG I PFRTTYAAS KHAALGFFDCLRAEVEE YDWI STVS PTF I R SYHVYPEQGNW 236 



Ill lllh lllllll lllllllhhh: == M :| I = I 

Db 181 QGKMS I PFRSAYAASKHATQAFFDCLRAEMEQYEI EVTVI SPGYI HTNLSVNAI TADGS - 239 



Qy 237 EAS I WKFFFRKLTYGV HPVEVAEEVMRTVRRKKQEVFMANPI PKAAVYVRT 287 

III llll|::h I :||::| :h U llhll 

Db 240 R YGVMDTTTAQGRS PVEVAQDVLAAVGKKKKDVI LADLLPSLAVYLRT 287 

Qy 288 FFPEFFFAWACGVKEK 304 

I lh-1 — 
Db 288 LAPGLFFSLMASRARKE 304 



RESULT 3 
Q9BTF9 

ID Q9BTF9 PRELIMINARY; PRT; 325 AA. 

AC Q9BTF9 ; 

DT 01-JUN-2001 (TrEMBLrel . 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 25-OCT-2004 (TrEMBLrel. 28, Last annotation update) 

DE DKFZP566O084 protein. 

GN Name=DKFZp5 660084 ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lung; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI =10 . 1073/pnas . 2426038 99 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T. , Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L. , Marusiha K. , Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M . , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M. J. , Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L. J. , Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley -R.-W . , Touchman J.W-. V _Green_.E. D. ..... .Dickson M.C.. 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , Butterfield Y.S., 

RA Krzywinski M.I., Skalska U. , Smailus D.E., Schnerch A. , Schein J.E., 

RA Jones S.J., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences . " ; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lung; 

RA Strausberg R. ; 

RL Submitted (MAR-2001) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lung; 



Database 



A_Geneseq_16Dec04 : * 

1: geneseqpl980s : * 

2: geneseqpl990s :* 

3: geneseqp2000s:* 

4: geneseqp2001s : * 

5: geneseqp2002s : * 

6 : geneseqp2003as : * 

7 : genes eqp2 003bs : * 

8: geneseqp2004s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 
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% 
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ID 
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Aab47593 
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311 
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ABP58046 
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100 
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Human deh 
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3 
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82 


0 
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3 


AAB08948 


Aab08948 


Human sec 


6 
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50 


2 
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8 


ADM33425 


Adm33425 


Human PRO 
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34 


4 
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4 


AAM24151 


Aam24151 


Human EST 


8 


70 


22 


5 


81 


8 


AB054789 


Abo54789 


Human gen 


9 


52 


16 


7 


311 


6 


ABP58047 


Abp58047 


Mouse deh 


10 


52 


' 16 


7 


311" 


8 


ADG4 02 01 


Adg4 0201 


Mouse deh 


11 


29 


9 


3 


29 


3 


AAB08949 


Aab0894 9 


Human sec 


12 


9 


2 


9 


31 


6 


ADA98202 


Ada98202 


Human sec 


13 


9 


2 


9 


31 


6 


ADA44050 


Ada44 050 


Human sec 


14 


9 


2 


9 


31 


7 


ADC20368 


Adc20368 


Human sec 


15 


9 


2 


9 


31 


7 


ADF10701 


Adfl0701 


Human sec 


16 


9 


2 


9 . 


32 


4 


AAB60717 


Aab60717 


Human sec 


17 


9 


2 


9 


281 


2 


AAW38423 


Aaw38423 


Rat cell 


18 


9 


2 


9 


281 


7 


ADE62861 


Ade62 861 


Rat Prote 



RESULT 4 
AAB08911 

ID AAB08911 standard; protein; 311 AA. 
XX 

AC AAB08911; 
XX 

DT 30-AUG-2000 (first entry) 
XX 

DE Human secreted protein sequence encoded by gene 21 SEQ ID NO: 68. 
XX 

KW Human; secreted protein; cytostatic; anti-prolif erative; vulnerary; 

KW immunosuppressive; antibacterial; diagnosis; immune system; chemotaxis; 

KW hyperprol if erative disorder; infectious disease; tissue regeneration; 

KW screening; food additive; preservat ive; wound healing; 

KW hyper-vascular disease; chromsome 11. 

XX 

OS Homo sapiens. 
XX 

PN WO200017222-A1. 
XX 

PD 30-MAR-2000 . 
XX 

PF 22-SEP-1999; 99WO-US022012 . 
XX 

PR 23-SEP-1998; 98US-0101546P . 

PR 02-OCT-1998; 98US-0102895P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Ruben SM, Rosen CA, Duan RD, Shi Y, Lafleur DW, Young PE, Ni J; 

PI Komatsoulis G, Endress GA, Soppet DR; 

XX 

DR WPI; 2000-283538/24. 

DR N-PSDB; AAA39072. 
XX 

PT Human secreted proteins and coding sequences useful in diagnostic and 

PT therapeutic methods for disorders such as immune system or proliferative 

PT disorders, related to the proteins. 
XX 

PS Claim 11; Page 366-367; 416pp; English. 
XX 

CC The polynucleotide sequences given in AAA39052 to AAA39088 encode the 

CC human secreted- -proteins given- in, AAB0889X to AAB.0.8S84... The human secreted 

CC proteins can have activities based on the tissues and cells they are 

CC expressed in. Examples of the activities are: cytostatic; anti- 

CC proliferative; immunosuppressive; antibacterial; and vulnerary. The 

CC secreted proteins and their related polynucleotide sequences are useful 

CC for diagnostic and therapeutic methods useful for diagnosing and treating 

CC disorders related to the secreted proteins. The proteins, and 

CC polynucleotide sequences may be useful for treating disorders of the 

CC immune system, hyperprol iterative disorders, infectious disease, 

CC regeneration of tissues, for chemotaxis and for screening molecules that 

CC bind to the proteins. The proteins or polynucleotide sequences may be 

CC used as food additives or preservatives, to increase or decrease storage 

CC capabilities, fat content, lipid, protein, carbohydrate, vitamins, 

CC minerals, co-factors or other nutritional components. Agonists or 

CC antagonists of the proteins may be used to prevent scar tissue growth 



CC during wound healing, and hyper-vascular diseases. AAA39043 to AAA39051 

CC and AAB08890 are sequences used in the exemplification of the present 

CC invention 
XX 

SQ Sequence 311 AA; 

Query Match 82.0%; Score 255; DB 3; Length 311; 

Best Local Similarity 100.0%; Pred. No. 2e-231; 

Matches 255; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

MGVMAMLMLPLLLLGI SGLLFI YQEVSRLWSKSAVQNKWVITDAI SGLGKECARVFHTG 60 

1 1 1 1 1 1 1 II 1 1 M 1 1 1 Ml 1 1 II 1 1 1 1 1 II II 1 1 1 II II 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 

MGVMAMLMLPLLLLGI SGLLFI YQEVSRLWSKSAVQNKWVITDAI SGLGKECARVFHTG 6 0 

GARLVLCGKNWERLENLYDALI SVADPSKTFTPKLVLLDLSDI SCVPDVAKEVLDCYGCV 120 

I I I I I I II I I I II I I I II i I I I I I I I I I II II I II II II I I I I I I I I I I III I II I II I I 

GARLVLCGKNWERLENLYDALI SVADPSKTFTPKLVLLDLSDI SCVPDVAKEVLDCYGCV 120 

DILINNASVTCVKGPAHKISLELDKKIMDANYFGPIT^^ 180 

I I II I I I I I I I II III I I I I.I I I I III II II II I M II II I I I II I II I I II I I I II ! I 

DILINNASVKVTCGPAHKISLELDKKIMDANYFGPITLTKALLPNMISRRTGQIVLWNIQ 180 

GKFG I PFRTTYAASKHAALGFFDCLRAEVEEYDWI STVS PTF I RS YHVYPEQGNWEAS I 24 0 

1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 1 II II 1 1 1 II II 1 1 1 1 1 II I II I II 1 1 1 1 II 1 1 

GKFG I PFRTTYAAS KHAALGFFDCLRAEVEE YD WI STVS PTF I RS YHVYPEQGNWEAS I 24 0 

WKFFFRKLTYGVHPV 255 

Illllllllllllll 
WKFFFRKLTYGVHPV 255 



Qy 


1 


DD 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 



Database 



Published_Applications_AA: * 



1 : /cgn2_6/ptodata/l/pubpaa/US07_PUBCOMB.pep: * 

2 : /cgn2_6/ptodata/l/pubpaa/PCTJSFEW_PUB.pep:* 

3 : /cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB.pep: * 

4 : /cgn2_6/ptodata/l/pubpaa/US06_PUBCOMB.pep: * 

5: /cgn2_6/ptodata/l/pubpaa/US07_NEW_PUB.pep: * 

6 : /cgn2_6/ptodata/l/pubpaa/PCTUS_PUBCOMB.pep: * 

7 : /cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB.pep: * 

8 : /cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB.pep:* 

9 : / cgn2_6 /p t oda t a / 1 /pubpa a / US 0 9A_PUBC0MB . pep : * 

10: /cgn2_6/ptodata/l/pubpaa/US09B_PUBCOMB.pep:* 

11 : /cgn2_6/ptodata/l/pubpaa/US09C_PUBCOMB . pep : * 

12 : /cgn2_6/ptodata/l/p\ibpaa/US09_NEW_PUB.pep: * 

13 : /cgn2_6/ptodata/l/pubpaa/US10A_PUBCOMB.pep: * 

14: / cgn2_6 /p t oda t a / 1 /pubpaa /US 1 0B_PUBCOMB . pep : * 

15 : /cgn2_6/ptoda ta/1 /pubpaa /US 10C_PUBCOMB .pep : * 

16: /cgn2_6/ptodata/l/pubpaa/US10D_PUBC0MB.pep: * 

17 : /cgn2_6/ptodata/l/pubpaa/US10_NEW_PUB.pep: * 

18 : /cgn2_6/ptodata/l/pubpaa/USll_NEW_PUB . pep : * 

19 : /cgn2_6/ptodata/l/pubpaa/US60_NEW_PUB.pep: * 

20 : 7cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB.pep: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



Query 

Score Match Length DB ID 



Description 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 



311 
311 
311 
311 
311 
311 
255 
255 
255 
255 



100.0 
100.0 
100.0 
100.0 
100.0 
100.0 
82.0 
82.0 
82.0 
82.0 



311 
311 
311 
311 
311 
311 
311 
311 
345 
345 



9 US-09-838-561-5 
9 US-09-816-760-5 

14 US-10-172-585-5 

15 US-10-144-433-2 
15 US-10-664-506-5 
15 US-10-144-433-2 
9 US-09-820-893-68 
15 US-10-607-565-68 
9 US-09-820-893-106 



15 US-10-607-565-106 



Sequence 5, Appli 

Sequence 5, Appli 
Sequence 5, Appli 
Sequence 2, Appli 
Sequence 5, Appli 
Sequence 2 , Appl i 

Sequence 68, Appl 
Sequence 68, Appl 

Sequence 106, App 
Sequence 106, App 



RESULT 1 

AI091419/C 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



AI091419 631 bp mRNA linear EST 30-OCT-1998 

ow62e03.xl Soares_NSF_F8_9W_OT_PA_P_Sl Homo sapiens cDNA clone 
IMAGE: 1651420 3' similar to WP:T25G12.7 CE07544 DEHYDROGENASE 
/contains TAR1 . t3 MER22 repetitive element ;, mRNA sequence. 
AI091419 

AI091419.1 GI:3430478 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 631) 

NCI -CGAP ht tp : //www . ncbi . nlm . nih . gov/ ncicgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 

Unpublished (1997) 

Contact: Robert Strausberg, Ph.D. 

Ema i 1 : cgapbs - r@ma i 1 . nih . gov 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
Insert Length: 1044 Std Error: 0.00 
Seq primer: -4 0ml3 fwd. ET from Amersham 
High quality sequence stop: 453. 
Location/Qualifiers 
1. .631 

/organism="Homo sapiens" 
/mol_type= n mRNA" 
/db_xref="taxon: 9606" 
/clone=" IMAGE: 1651420" 
/lab_host="DH10B" 

/clone_l ib= " Soares_NSF_F8_9W_OT_PA_P_Sl " 

/note="Organ: pooled; Vector: pT7T3D-Pac (Pharmacia) with 
a modified polylinker; Site_l: Not I; Site_2 : Eco RI ; 
Equal amounts of plasmid DNA from five normalized 
libraries were mixed, and ss circles were made in vitro. 
Following HAP purification, this DNA was used as tracer in 
a subtractive hybridization reaction. The driver was 
PCR-amplif ied cDNAs from pools of 5,000 clones made from 
the same 5 libraries. The pools consisted of the following 
libraries and clonelDs: Soares NbHSF pool 1: 

-3 053 S4-310919- # -3232.08^3258.95_Soaxes _Nb2HP .pool, lj 

145032-147335, 147720-148103, 148872-149255, 15002 - ' 
150407, 151176-152327 Soares Nb2HF8-9W pool 1: 
758280-760583, 772104-774407 Soares NbHPA pool 1: 
304776-306311, 320136-322823, 326280-326663 Soares NbHOT 
pool 1: 723720-726407, 739080-740999 Subtraction by Bento 
Soares and M. Fatima Bonaldo." 



ORIGIN 



Query Match 45.0%; Score 621; DB 1; Length 631; 

Best Local Similarity 100.0%; Pred. No. 5.9e-294; 
Matches 621; Conservative 0; Mismatches 0; Indels 



0 ; Gaps 



0; 



Qy 



698 TC^TCAACAATGCCAGTGTGAAGG 757 

MINIM MINI MINI IMIIIIIIIIMI II III III II II INI II Mill III 



UD 


COT 


TraTra apa a npp pp a P r rp r rp a AnPTPA a pppppptpppp a t 1 a apa t" u t u t , ptpt i pp a p ptpp 
1 Uii LAAL-AAlbLL!ALx lb IuAALHj IbAALibobLL. IbLLLA 1 AALiA 1 1 iLlL lbbAbL 1 Lb 




Qy 


758 


ACAAAAAGATCATGGATGCC^TTACTTTGGCCCCATCACA 


817 






' 1: ! 1 . 1 1 1 ' 1 1 1 1 




UD 


DDI 


a pa a a a z\PATPaTTir , 7\Tr , pr i a atts ptttpppppp a tp a pATTY^ArrA a rppppty^pttp 
ALAAAAALiA 1 LA IbbA IbLL-AA 1 1 AL I 1 1vjoL.UL.L-A 1 LA LA 1 IbALbAAAbLLL IbL 1 1L 


c P* o 


Qy 


818 


CCAAC^TGATCTCCCGGAGAACAGGCCAAATCGTGTTAGTGAATAATATCCAAGGGAAGT 


877 






IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 




UD 




OP A A pa TP A tptpppyip apa apaP-PPPA a a tpv'tptta ptp a ata atatppa Appniv nprn 
LLAALA IbA 1L1 L L LbLiAuAA LALxLj L LAAA i Lb lb 1 1 Ab IbAA 1 AA 1 A 1 LLIAAbbbAAlj 1 


/i A O 


Qy 


878 


TTGGAATCCCGTTCCX^ 


937 






1 1 1 1 1 1 1 1 1 II II 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 II ! 1 1 II 1 1 1 1 1 II 1 II II II 1 1 1 II 1 1 1 1 




UD 




ttppa a tpppp ttpppt a pp a ptt a nnr tr vr > nr ir vr t r i Ts. appa pppa pppptppppttptttp 
1 IbbAAlLLLbl ILLbiALliALl 1 ALUL IbLL 1 LbAALibALLiLAULLL lbbUL 1 1L1 1 lb 


J oz 


Qy 


938 


ACTGCCTCCGAGCCGAAGTGGAGGAATACGATGTTGTCATC^ 


997 






MM III III III 1 Mill III lllllllllll II 1 Mill II III II II llllll II II 




UD 




a ptp pptppp appppa aPTPPapp a at a py* -1 atpttptpa tpappa p pp tp appp pp a ptt 
AL IbLL, 1 LLbALiLLbAAb IbLiALrbAA 1 ALbA lb 1 lb 1 vJA i LALj LA L Lb IbAuLLLbAL 1 i 


o o o 


Qy 


998 


TCATCCGGTCGTACCACGTGTATCCAGAGCAAGGAAACTGGGAAGCTTCCATTTGGAAAT 


1057 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




UD 




fppl\ TPPPPTPPT A PPA ppfppfpA '"PPP t\ /-i 71 /~i /-1 t\ t\ /""i a A 7V P»T»/^/~«/~i TV A /-ipmmripjv ri'irpf I'yio AAA rp 

1 LA 1 LLbb 1 Lb 1 ALlaLLj 1 GTATLlaGAGlAAGGAAACTGGGAA 


262 


Qy 


1058 


TCTTTTTCAGGAAG CTGACCTACGG CGTGCACCCAGTAGAGGTGG CGGAGGAGGTGATG C 


1117 






MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 




UD 




•"PPT'P'I iriuiip* PP A APPTP A PPTA PPP PP HPP P A PPPA PTAPA opm/-i/-ipp/"i tv 71 nnmn t\ rp/-i/-t 

1 L 1 1 1 1 1 L^bbAALrL. 1 LxALL 1 ALUbLb IbLALLL-Ab 1 AbAGb 1 GGLGGAGGAGGTGATGL 


O /"1 1 

2 02 


Qy 


1118 


G CA C CGTG CGGAGGAAGAAG CAAGAGGTG TTTATGG C CAAC CC CAT C C C CAAGG C CG C CG 


1177 






II 1 1 1 1 M II 1 1 1 1 1 1 1 1 II II 1 1 II 1 II 1 1 1 II II II 1 II II II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 




UD 


z u i 


PPA P PP ""PP PPP APPA A P 1 A APPA APA PPTPM"I"I'A T'PPPPA A PPPPA rp/-i/—»/-i/^i 71 71 f~*r~\ f~i /~i 

LjLAL Lb lb LbbAbbAAbAAb CAAL^bbTbTTTATL^LCAACLCL^^ CCGCCG 


142 


Qy 


1178 


TGTACGTCCGCACCTTCTTC 


1237 






1 M f 1 1 i 1 1 1 1 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 ] 1 1 1 [ i 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 i 1 




Db 


141 


TGTACGTCraCACCTTCTTCCOjGAGTTCTTTTTCGCCGT^ 


82 


uy 




AbAAbLT(^AATGTCCCGGAGGAGGGGTAACTG CAGGAGG CCAAATGGGCCACCCCTTGGA 


1297 






M II 1 1 II II 1 II II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 MMMMMM MM 




Db 


81 


AGAAGCTCAATGTCCCGGAGGAGGGGTAACTG CAGGAGG CCAAATGGG CCACCCCTTGGA 


22 


Qy 


1298 


AATAAAGGTTTTTCTGGCAAA 1318 




Db 


21 


MMMMMMMMMMI 

AATAAAGGTTTTTCTGGCAAA 1 





RESULT 2 

AI741629/C 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



AI741629 632 bp mRNA linear EST 19-DEC-1999 

wg28f07.xl Soares_NSF_F8_9W_OT_PA_P_Sl Homo sapiens cDNA clone 
IMAGE: 2366437 3' similar to WP:T25G12.7 CE07544 DEHYDROGENASE 
;contains TAR1 . t3 MER22 repetitive element ;, mRNA sequence. 
AI741629 

AI741629. 1 GI :5109917 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 632) 

NCI -CGAP http : //www . ncbi . nlm . nih . gov/ncicgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 
Tumor Gene Index 
'Unpublished (1997) 
Contact: Robert Strausberg, Ph.D. 
Ema i 1 : cgapbs - r@ma i 1 . n ih . gov 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
Insert Length: 1035 Std Error: 0.00 
Seq primer: -4 0UP from Gibco 
High quality sequence stop: 444. 

Location/ Qua 1 i f i e r s 

1. .632 

/organism="Homo sapiens" 
/mol_type= n mRNA M 
/db_xref= ,! taxon: 9606" 
/clone=" IMAGE: 2366437" 
/lab_host="DH10B" 

/clone_l ib= "Soares JtfSF_F8_9W_OT_PA_P_Sl " 

/note="Organ: pooled; Vector: pT7T3D-Pac (Pharmacia) with 
a modified polylinker; Site_l: Not I; Site_2 : Eco RI ; 
Equal amounts of plasmid DNA from five normalized 
libraries were mixed, and ss circles were made in vitro. 
Following HAP purification, this DNA was used as tracer in 
a subtract ive hybridization reaction. The driver was 
PCR-amplif ied cDNAs from pools of 5,000 clones made from 
the same 5 libraries. The pools consisted of the following 
libraries and clonelDs: Soares NbHSF pool 1: 
309384-310919.. 3232.08-325895 Soares Nb2HP pool 1: 
145032-147335, 147720-148103, 148872-149255, 15002 - 
150407, 151176-152327 Soares Nb2HF8-9W pool 1: 
758280-760583, 772104-774407 Soares NbHPA pool 1: 
304776-306311, 320136-322823, 326280-326663 Soares NbHOT 
pool 1: 723720-726407, 739080-740999 Subtraction by Bento 
Soares and M. Fatima Bonaldo." 



ORIGIN 



Query Match 4 0.0%; Score 551; DB 1; Length 632; 

Best Local Similarity 99.8%; Pred. No. 1.7e-259; 

Matches 601; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 



705 CAATGCC^GTGTGAAGGTGAAGGGGCCTGCCC^TA^ 764 

MMM MM I II .III II I III I II: I II I IM ! Ill : II 1 1 II I II Ml 1 1 Ml 



Db 




/"17V A OVT'P A PTVTV A A PHTVA A PPPPPPTPPPPA TA AP 1 A ' I " 1 u 1 1 /~"T" /^"TV" 1 A P 1 f""T l /'"V~ 1 A P*A P 1 A A A 

CAA 1 XjLLALj I \j IvJAAuG IviAACjGGOCCTGCCCA I AAUA I Fl L rC IXjGALjC 1 CGACAbAAA 




Qy 


765 


GATCATGGATGCCAATTACTTTGGCCCGA^ 


824 




IMIIIMIMIIIIIIMMIIIIIIIIIIIIMIIMMMIIIIIIIIIIIIIIII! 




UD 




P 1 A TP 1 A T^V A r PP , P t P t A A TTA prpmmpppppp t\ rp/^A /~*A r I " I V — * A PV 1 A A APPPPTP pmmpppTi A P 1 A T 

UA I CA I UtiA 1 VjCCAA I lACi 1 JAjIjCCCCAI CACA1 IVxACVjAAAOCCCiVjCI ICCCAACAi 


4 y b 


Qy 


825 


GATCTCCCGGAGAACAGGCCAAATCGTGTTAGTGAATAATA 


884 






M 1 1 1 1 M 1 1 II 1 1 1 1 1 1 M 1 1 1 M 1 MMI 1 Ml 1 1 M 1 1 II 1 1 1 1 II 1 1 1 1 I 1 1 1 1 1 ! 




UD 


4 y o 


P 1 A r PP"T , P 1 P 1 PY~!P 1 A P* A A PAPPPPA A A r PPP f TV* 1 fTu p A PTV 1 A A T'A A ♦"PA T»PPA APPPA jiprrvpmp/i A A T 1 

bA 1 C 1 C C CCiLiALjAA CACO C CAAA I Clj l\j 1 1 AL> 1 vjAA 1 AA i A 1 C CAAUOCAAu 1 1 luCJAA 1 


/IOC 

4 jo 


Qy 


885 


CCCGTTCCGTAQ^CTTACGCTGCCTC 


944 






i i i ; i i Mi' 




nK 
UD 


A 1 tZ 

4 jb 


CCCG J. 1 CCG 1 ACviACTTACGCTGCCTCCAAGGAl^ 


376 


Qy 


945 


CCGAGCCGAAGTGGAGGAATACGATGTTGTCATCAGC^ 


1004 






1 II II 1 1 1 II 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 II II ! II 1 1 1 II 1 1 1 1 1 II Ml 1 II II II 1 1 1 




nh 

UD 


J /o 


P 1 P 1 P 1 A P 1 P'P'P 1 A APTPPA P'P* A A T'A PP A mn'IT'PT'P A TP* A P'P 1 A P^PV^TV" A P" P«P , /""V~' A /"*H "T"PPA TZ-ipV"* 

CCC^LjCCUAAvj IXjKjAGUAA 1 ACUA L\j 1 L\j 1 CA X CAGCACCG IXjAGCCCGACTTTCA J/CCG 


316 


Qy 


1005 


GTCGTACC^CGTGTATCCAGAGC7UVGGAAACTGGGAAGCTTCCATTTGGAAATTCTTTTT 


1064 






M 1 Ml 1 1 1 1 1 1 1 1 1 II 1 II II 1 III 1 1 MM 1 II 1 1 1 1 1 1 1 1 II II II 1 III 1 1 II 1 M 




r>K 
UD 


J Id 


GTCGTACCACGTGTATCCAGAG CAAGGAAACTGGGAAGCTTCCATTTG 


0 c r 

256 


Qy 


1065 


CAGGAAG CTGA CCTA CGG CGTG CAC C CAGTAGAGGTGG CGGAGGAGG TGATG CG CAC CGT 


1124 






III III Mill II II II MM Mill III II II MM II II II II II II III III Mill 




nK 
UD 




PAPPA A P 1 PTP A P" , /"" ,r P A C~*f~*r~* m/-> /~i a OO/^AOT'A/^'A P/'inv^P /— 71 /~i /~i 71 rrvi a rn/"i nrrii\ /"i f~\r~\ m 

LAbbAAbL 1 UAL LTA CGG CGTG CAC CCAGT 


196 


Qy 


1125 


GCGGAGGAAGAAGCAAGAGGTGTTTATGGCCAACCCC^TCCCCAAGGC 


1184 






IMMMIMMIMMMMMMMMMMMMMMMMMMMMMIMM 




nK 

UD 


1 QC 

±y b 


P* PVP 1 A pip 1 A A P 1 A APPA A P 1 A P'P* TV"!" 1 " 1 1 A T'PP'PPA A f~^f^ A rnnpripTi a /"*/"*•/""' /"'/""' /""* iTV"i rp A /~V""p 

vjCCjbALxGAAGAAGCAAGAGGTGTTTATGGCC^^ 


136 


Qy 


1185 


CCGCACCTTCTTCCCGGAGTTCTTTTTCGCCGTGGTGGCCTGTGGGGTGAAGGAGAAGCT 


1244 






1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 II Ml 1 Ml 1 II II 1 1 1 M 1 1 Ml 1 M 1 1 M 1 III 1 Ml M i 




Db 


135 


CCGCACCTTCTTCCCGGAGTTCTTTTTCGCCGTGGTGGCCTGTGGGGTGAAGGAGAAGCT 


76 


/-\, , 

Qy 


1245 


CAATGTCCCGGAGGAGGGGTAACTG CAGGAGG CCAAATGGGCCACCCCTTGGAAATAAAG 


1304 






1 Mill III II II 1 III III MIIMI IMIII MM II MM II II II MMMM II 




Db 


75 


CAATGTCCCGGAGGAGGGGTAACTG CAGGAGGCCAAATGGGCCACCCCTTGGAAATAAAG 


16 


Qy 


1305 


GT 1306 




Db 


15 


II 

GT 14 





Database : N_Geneseq_16Dec04 : * 

1: geneseqnl980s: * 

2: geneseqnl990s : * 

3: geneseqn2000s:* 

4 : geneseqn2001as : * 

5: geneseqn2001bs : * 

6 : geneseqn2 002as : * 

7 : geneseqn2 002bs : * 

8 : geneseqn2003as : * 

9 : geneseqn2003bs : * 
10 : geneseqn2003cs : * 
11 : geneseqn2003ds : * 
12 : geneseqn2004as : * 
13 : geneseqn2004bs : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


1379 


100 


0 


1379 


4 


AAH43561 


Aah43561 Human DHD 


2 


1379 


100 


0 


1379 


10 


ABV76050 


Abv76050 Human deh 


3 


1379 


100 


0 


1379 


12 


ADG40198 


Adg40198 cDNA enco 


4 


1004 


72 


8 


1114 


8 


ABX71069 


Abx71069 Novel hum 


5 


955 


69 


3 


1157 


3 


AAA39072 


Aaa39072 Human sec 


6 


936 


67 


9 


936 


4 


AAH43565 


Aah43565 Human DHD 


7 


677 


49 


1 


797 


12 


ADM33424 


Adm3 3424 Human PRO 


8 


327 


23 


7 


780 


. 4 


AAH98810 


Aah98810 Human EST 


9 


208 


15 


1 


247 


12 


ACH82344 


Ach82344 Human gen 


10 


208 


15 


1 


514 


12 


ACH68644 


Ach68644 Human gen 


11 


159 


11 


5 


599 


12 


ACH69248 


Ach69248 Human gen 



Database : GenEmbl:* 



1 




gb ba : * 


2 




ab hta : * 


3 




gb in : * 


4 




gb oni : * 


5 




gb ov : * 


6 




gb pat : * 


7 




qb ph : * 


8 




gb_pl : * 


9 




gb_pr:* 


10: 


gb_ro : * 


11: 


gb_sts : * 


12: 


gb_sy: * 


13: 


gb_un : * 


14 : 


gb_vi : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



NO. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


1379 


100 


0 


1379 


6 


AR278046 


AR278046 


Sequence 


2 


1379 


100 


0 


1379 


6 


AR392052 


AR392052 


Sequence 


3 


1379 


100 


0 


1379 


6 


AR403912 


AR403912 


Sequence 


'4 


1379 


100 


0 


1379 


6 


AX320639 


AX320639 


Sequence 


5 


1004 


72 


8 


1114 


6 


AR542048 


AR542048 


Sequence 


6 


955 


69 


3 


1157 


6 


BD233739 


BD233739 


3 1 human 


7 


933 


67 


7 


933 


6 


AR278047 


AR278047 


Sequence 


8 


933 


67 


7 


933 


6 


AR392053 


AR392053 


Sequence 


9 


933 


67 


7 


933 


6 


AR403913 


AR403913 


Sequence 


10 


933 


67 


7 


933 


6 


AX320641 


AX320641 


Sequence 


11 


932 


67 


6 


932 


6 


CQ731016 


CQ731016 


Sequence 


12 


451 


32 


7 


188646 


9 


AC027045 


AC027045 


Homo sapi 



Database : EST:* 



1 


ah pgtl ' 
~j j'-' * — ij l i . 


* 


2 


gb est 2 : 


* 


3 


gb~htc: * 




4 


gb_est3 : 


★ 


5 


gb_est4 : 


★ 


6 


gb_est5 : 


* 


7 


gb_est6 : 


★ 


8 


gb_gssl : 


★ 


9 


gb_gss2 : 


* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 





No. 


Score 


Match Length DB 


ID 


Description 


c 


1 


621 


45 


0 


631 


1 


AI091419 


AI091419 ow62e03.x 


c 


2 


551 


40 


0 


632 


1 


AI741629 


AI741629 wg28f07.x 


c 


3 


495 


35 


9 


562 


1 


AI458236 


AI458236 tj53e07.x 


c 


4 


485 


35 


2 


632 


1 


AI741640 


AI741640 wg28g07.x 




5 


478 


34 


7 


483 


9 


AY421548 


AY421548 Homo sapi 


c 


6 


473 


34 


3 


473 


1 


AI376903 


AI376903 tc27f05.x 


c 


7 


469 


34 


0 


469 


1 


AI141463 


AI141463 qa67dl2.x 


c 


8 


414 


30 


0 


558 


1 


AI222126 


AI222126 qh02g04.x 


c 


9 


261 


18 


9 


505 


1 


AA953672 


AA953672 oo02e08.s 




10 


257 


18 


6 


309 


7 


F26544 


F26544 HSPD14061 H 


c 


11 


256 


18 


6 


307 


1 


AI470361 


AI470361 tj42f03.x 


c 


12 


254 


18 


4 


477 


1 


AI168267 


AI168267 oolOclO.x 




13 


248 


18 


0 


267 


9 


AY421549 


AY421549 Pan trogl 




14 


247 


17 


9 


274 


7 


F35823 


F35823 HSPD32901 H 


c 


15 


243 


17 


6 


396 


1 


AA928254 


AA928254 on79a08.s 


c 


16 


222 


16 


1 


480 


1 


AI022337 


AI022337 ow95all.x 



